Application of unsupervised analysis techniques to lung cancer patient data

نویسندگان

  • Chip M Lynch
  • Victor H van Berkel
  • Hermann B Frieboes
چکیده

This study applies unsupervised machine learning techniques for classification and clustering to a collection of descriptive variables from 10,442 lung cancer patient records in the Surveillance, Epidemiology, and End Results (SEER) program database. The goal is to automatically classify lung cancer patients into groups based on clinically measurable disease-specific variables in order to estimate survival. Variables selected as inputs for machine learning include Number of Primaries, Age, Grade, Tumor Size, Stage, and TNM, which are numeric or can readily be converted to numeric type. Minimal up-front processing of the data enables exploring the out-of-the-box capabilities of established unsupervised learning techniques, with little human intervention through the entire process. The output of the techniques is used to predict survival time, with the efficacy of the prediction representing a proxy for the usefulness of the classification. A basic single variable linear regression against each unsupervised output is applied, and the associated Root Mean Squared Error (RMSE) value is calculated as a metric to compare between the outputs. The results show that self-ordering maps exhibit the best performance, while k-Means performs the best of the simpler classification techniques. Predicting against the full data set, it is found that their respective RMSE values (15.591 for self-ordering maps and 16.193 for k-Means) are comparable to supervised regression techniques, such as Gradient Boosting Machine (RMSE of 15.048). We conclude that unsupervised data analysis techniques may be of use to classify patients by defining the classes as effective proxies for survival prediction.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extraction and 3D Segmentation of Tumors-Based Unsupervised Clustering Techniques in Medical Images

Introduction The diagnosis and separation of cancerous tumors in medical images require accuracy, experience, and time, and it has always posed itself as a major challenge to the radiologists and physicians. Materials and Methods We Received 290 medical images composed of 120 mammographic images, LJPEG format, scanned in gray-scale with 50 microns size, 110 MRI images including of T1-Wighted, T...

متن کامل

Sentinel Node Mapping in Non-small Cell Lung Cancer Using an Intraoperative Radiotracer Technique

 Objective(s): Lymph node metastases are the most significant prognostic factor in localized non-small cell lung cancer (NSCLC). Identification of the first nodal drainage site (sentinel node) may improve detection of metastatic nodes. Extended surgeries, such as lobectomy or pneumonectomy with lymph node dissection, are among the therapeutic options of higher acceptab...

متن کامل

Comparing different techniques of Post Axillary field in Breast Cancer Treatment

As we know breast cancer is the second death reason in Iran. One step of treatment process is radiotherapy, which needs careful consideration of contouring and therapeutic techniques Lung, thyroid, spinal cord, trachea and humerus are sensitive organs in breast cancer radiation therapy.  The most clinical studies recommended two ways for delivering 95 percent of dose to supraclavicular and ...

متن کامل

Analysis of the Results of Pulmonary Resection by Minimally Invasive Thoracoscopy for the Surgical Treatment of Lung Cancer

Introduction: Lung cancer is the disease of modern era, and the rate of lung cancer mortality is three times as high as that for prostate cancer and twice as high as the rate for breast cancer. We aimed to analyze the results of pulmonary resection in patients with NSCLC by minimally invasive thoracoscopy.  Materials and Methods: We studied 10 patients with NSCLC scheduled for surgical resectio...

متن کامل

Influence of different treatment planning techniques on radiation doses to the heart, left anterior descending coronary artery and left lung in the radiotherapy of left-sided breast cancer patients

Background: Breast-conserving surgery (BCS) followed by radiotherapy (RT) is the standard of care for women with breast cancer. Evidence shows that RT dose to the heart can result in ischemic heart disease. In this study we compared 3 different RT techniques were for heart, left anterior descending coronary artery (LAD) and lung doses in left breast cancer patients after breast-conserving surge...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2017